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1 Introduction 



In the present paper, we study the recurrence of continuous-time Markov processes. More pre- 
cisely, we provide a criterion that yields a precise control of a subgeometric moment of the 
return-time to a test-set. The obtained result permits further quantitative analysis of charac- 
teristics such as the regularity of the process, the rate of convergence to the stationary state, 
and a moderate deviation principle. 

The stability and ergodic theory of continuous-time Markov processes has a large literature 
which is mainly devoted to the geometric case (also referred to as the exponential case). Meyn 
and Tweedie developed stability concepts for continuous-time Markov processes as well as sim- 
ple criteria for non-explosivity, non-evanescence, Harris-recurrence, positive Harris-recurrence, 
ergodicity and geometric ergodicity [21, 23, 24]. Of particular importance in actually applying 
these concepts is the existence of verifiable conditions. In the discrete-time context, development 
of Foster-Lyapunov type conditions on the transition kernel has provided such criteria (e.g. [22]). 
In the continuous-time context, Foster-Lyapunov inequalities applied to the generator of the pro- 
cess play the same role. These criteria were successfully applied to the study of the solution to 
stochastic differential equations (see [16] and more recently, [14] and references therein). Results 
relative to rates of convergence slower than geometric are not so well established. In [32, 20] 
(resp. [34]), polynomial and sub-exponential ergodicity of stochastic differential equations (resp. 
sub-exponential ergodicity of queuing models) are addressed, but these results are quite model- 
specific. Fort and Roberts [11] are, to our best knowledge, the first to study the subgeometric 
ergodicity of general strong Markov processes. Their conditions are in terms of subgeomet- 
ric moment of the return-time to a test-set. Fort and Roberts derive nested drift inequalities 
on the generator of the process that makes the result of practical interest in the polynomial case. 

One of the application of the condition we derive in the present paper makes the Fort- 
Roberts's theory applicable for more general subgeometric rates such as the logarithmic or the 
sub exponential ones. It also provides criteria for the (/, r)-regularity of a process, a characteristic 
which is an extension of the regularity concept [23]. We obtain theoretical results that are 
analogous to those in the discrete-time case [31]. We then relate our condition to a criterion 
based on the generator of the process. This criterion is the natural analogue of the Foster- 
Lyapunov condition for the geometric case; it also provides a single drift condition that generates 
the set of nested drift conditions by Fort-Roberts [11] for the polynomial case. Furthermore, it 
is analogous to the discrete-time version recently proposed by Douc-Fort-Moulines-Soulier [4]. 
In the literature, one approach for the theory of continuous-time Markov process is through the 
use of associated discrete-time chains : the resolvent chains and/or a skeleton chain. We discuss 
how our condition is related to a subgeometric drift inequality for these discrete-time Markov 
chains. As a consequence, we state new limit theorems such as moderate deviations for integral 
of bounded functionals, thus weakening the conditions derived in Guillin-Wu [15, 33]. 

Our conditions are then successfully applied to various non trivial models: (a) we first con- 



2 



sider elliptic stochastic differential equations for which conditions on the drift function enable 
us to generalize results by Veretennikov [32], Ganidis-Roynette-Simonot [12] or Malyshkin [20] 
(see also Pardoux- Veretennikov [27] for a study of the regularity of the solution of the Poisson 
equation under this drift condition); (b) we then study a "cold" Langevin tempered diffusion 
when the invariant target distribution is subexponential in the tails. This model is particu- 
larly useful in Markov Chain Monte Carlo method. Different regime of ergodicity (polynomial, 
subexponential or exponential) depending on the coldness of the diffusion term are exhibited, 
the different regimes are then characterized by the invariant target distribution. This study 
generalizes the Fort-Roberts' results, which consider the case when the target density is polyno- 
mial in the tails [11]; (c) we also give a toy hypoelliptic example, namely a stochastic damping 
Hamiltonian system, in the case when it cannot be geometrically ergodic. This model is shown 
to be polynomially ergodic (see Wu [33] for the exponential case); (d) we finally consider a sim- 
ple compound Poisson-process driven Ornstein-Uhlenbeck process (relevant for recent studies in 
financial econometrics) with heavy tailed jump. It is shown to be subgeometrically ergodic. 

Our approach may be considered as a probabilistic one. There are another ways to get 
subexponential rates of convergence (in total variation norm, in L 2 or in entropy) such as 
those based on spectral techniques (as in [12]), or on functional inequalities (weak Poincare 
inequalities [29] or weak logarithmic Sobolev inequalities [1]). These results are however not 
easy to compare to ours and we postpone a comparative utilization of these approaches to 
further research. 

Let us finally present the organization of the paper. Section 2 recalls basic definitions on 
Markov processes. The main results are given in Section 3. All the proofs are postponed in 
appendix. Section 4 is devoted to the examples and Section 5 to a new moderate deviation 
principle. 



2 Definitions 

Let (Q,F,(Ft)t>0i(Xt)t>0i($'x)xex) be a Markov family on a locally compact and separable 
metric space X endowed with its Borel cr-field B(X) : (fi,.F) is a measurable space, (Xt)t>o is a 
Markov process with respect to the filtration (J~t)t>o and (resp. E x ) denotes the canonical 
probability (resp. expectation) associated to the Markov process with initial distribution the 
point mass at x. Throughout this paper, the process is assumed to be a time-homogeneous 
strong Markov process with cad-lag paths, and we denote by (Pt)t>o the associated transition 
function on (X, 23 (X)). 

Let Ao denote the class of the measurable and nondecreasing functions r : [0, +oo) — ► [2, +00) 
such that log r(t)/t j as t — > +00. Let A denote the class of positive measurable functions f, 
such that for some r E Ao, 

r r(t) f(t) 
< liminf — — < hmsup —-r < °o- 
t r{t) ~ t r(t) 
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A is the class of the subgeometric rate functions and examples of functions r £ A are 

f(t) =t a (\ogtf exp(jt s ) 

for < 5 < 1 and either 7 > 0, or 7 = and a > 0, or 7 = a = and (3 > 0. We are ultimately 
interested in conditions implying that for all x £ X 

lirri r(i) ||P*(x, •) - 7r(-)||/ = 0, (2.1) 

t — >+oo 

where r £ A, ir is the (unique) invariant distribution of the process i.e. ixP 1 = ir for all 
t > 0, and for a signed measure ^i, ||^||/ = sup| 9 |<j |/u(c;)| where / : X — > [1, 00) is a measurable 
function. When / is the constant function 1 (l(t) = 1, t > 0), the /-norm is nothing more than 
the total variation norm. 

To that goal, we will need different notions of regularity and stability of continuous-time Markov 
processes and we briefly recall some basic definitions. The process is ^-irreducible for some 
cr-finite measure c/> on B(X) if (j)(A) > implies K x [J °° 1^(X S ) cis] > for all x £ X. A 
^-irreducible process possesses a maximal irreducibility measure ip such that (ft is absolutely 
continuous with respect to ip for any other irreducibility measure (ft [25] . Maximal irreducibility 
measures are not unique and are equivalent. A set A £ /3(X) such that VK^) > for some 
maximal irreducibility measure is said accessible; and full if i(j(A c ) = 0. A measurable set C is 
z^a-petite (or simply petite) if there exist a probability measure a on the Borel u-field of [0, +00) 
and a non-trivial cr-finite measure v a on B(X) such that 

r+00 

VxeC, / P\x,-) a(dt) > v a (-). 

Jo 

For a ^-irreducible process, an accessible closed petite set always exists [21]. A process is Harris- 
recurrent if, for some cr-finite measure [i, fi(A) > implies that the event {/ °° lA{X s )ds = 00} 
holds P-j-a.s. for all a; £ X. Harris-recurrence trivially implies (^-irreducibility. A Harris-recurrent 
right process possesses an invariant measure 7r [13]; if ir is an invariant probability distribution, 
the process is called positive Harris-recurrent. A c/>-irreducible process is aperiodic if there exists 
an accessible z^ m -petite set C and io such that for all x £ C, t > to, P t (x, C) > 0. A suffi- 
cient condition for a positive Harris-recurrent process to be aperiodic is the existence of some 
c/>-irreducible skeleton chain [23]; recall that a skeleton P m (m > 0) is said c/>-irreducible if there 
exists a cr-finite measure n such that [i(A) > implies Vx £ X, 3k £ N, P km (x,A) > [22]. 
A ^-irreducible and aperiodic Markov process that verifies (2.1) is said /-ergodic at a subge- 
ometric rate (or simply /-ergodic when r = 1). When r is of the form r{t) = k 1 for some 
K > 1, the process is said /-ergodic at a geometric rate. In the literature, criteria for the 
stability of Markov processes, when stability is couched in terms of Harris-recurrence, positive 
Harris-recurrence, /-ergodicity, with in this latter case, a mention of the rate of convergence, 
are expressed in terms of hitting-times of some closed petite set. For any 5 > and any closed 
set C £ B(X), let 

T C (5) = mf{t>5,X t £C}, 
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be the hitting-time on C delayed by 5 and define its (/, r)-modulated moment 



G c (x,f,r;8)=E x 



r(s) f(X s ) ds 



where / : X — > [1, oo) is a measurable function and r : [0,+oo) — > (0,+oo) is a rate function. 
When / = 1 (resp. r = 1), this moment is simply called the r-modulated (resp. /-modulated) 
moment. Following discrete-time usage [22, 31, 17], we call a measurable set C (/, r)-regular if 

supG B (x, f,r;5) < oo, 

for all 5 > and all accessible set B. Criteria for Harris-recurrence and positive Harris-recurrence 
can be found in [21, Theorems 1.1 and 1.2]; ergodicity and /-ergodicity are addressed in [23, 
Theorems 6.1 and 7.2]; criteria for geometric /-ergodicity at a geometric rate (resp. at a subge- 
ometric rate) are provided by [6, Theorem 7.4] (resp. [11, Theorem 1]). A short review of these 
notions and results can be found in [11]. 

In many applications, these moments can not be explicitly calculated; a second set of criteria 
based on the extended generator were thus derived for some of the stability properties above. 
We postpone to Section 3.4 a review of the existing conditions. 



3 Main results 



Let us consider the following drift condition towards a closed petite set C. 



D(C, V, (j), b): There exist a closed petite set C, a cad-lag function V : X — > [l,oo), an 
increasing differentiable concave positive function (f> : [1, oo) — > (0,oo) and a constant 
b < oo such that for any s > 0, x G X, 



E x [V(X s )]+E a 



(j) o V(X u )du 



< V{x) + bE x 



lc(X u )du 



(3.1) 



Note that (3.1) is equivalent to the condition that the functional 

s ^ V(X S ) - V(X ) + f <j> o V{X u )du - b [ l c {X u )du 

Jo Jo 

is, for all x £ X, a P x -supermartingale with respect to the filtration {J-t)t>o- 

The main result of Section 3.1 is Theorem 3.1 that states that this drift condition allows 
the calculation of an upper bound for some r-modulated moment where r G A, and for some 
/-modulated moment, / > 1. Using interpolating inequalities, we obtain (/, r)-modulated 
moments for a wide family of pairs (f,r). Section 3.2 is devoted to (/, r)-regularity : the main 
result of this section is Proposition 3.7 that identifies (/, r)-regular sets from the condition 
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D(C, V, (j>, b). In Section 3.3, we show that the drift condition D(C, V, (f>, b) provides a simple 
sufficient condition for an aperiodic strong Markov process to be /-ergodic at a subgeometric 
rate : the main result is Theorem 3.10 that builds on the work by Fort and Roberts [11]. We 
provide in Section 3.4 a condition couched in terms of the extended generators that implies the 
drift inequality D(C, V, eft, b). This condition generalizes the condition in [11, Proposition 6] 
that restricts to the polynomial case, and reveals of great interest in many applications. We 
present in Section 3.5 the interplay between a drift condition on the resolvent kernel and the 
drift condition D(C,V,i^, b). 
All the proofs are given in Appendix A. 



3.1 Modulated moments 



We show that D(C, V,0, b) is a simple condition that allows the control of /-modulated mo- 
ments and r-modulated moments, for a general rate function r £ A, of the delayed hitting-time 
t c {6). Let 




u > 1 . 



Theorem 3.1. Assume D(C, V, (f>, b) . 



i) For all x 6 X and 5 > 0, 



E, 

ii) For all x 6 X and 5 > 0, 

-to (a) 



rc{5) 



V{X S ) ds 



< V(x) - 1 + b5 . 



oH7 l (s) ds 



H-\s)ds . 



The proof of Theorem 3.1 does not require C to be petite. Nevertheless, this petiteness 
property will be crucial in all the following results: we will see that this assumption allows the 
extension of the above controls to those of modulated moments tb(5) for any accessible set 
B. Theorem 3.1 gives the largest /-modulated and r-modulated moments of tq (S) that can 
be deduced from D(C, V, 0, b). Interpolated (/, ry)-modulated moments of tc{5) can easily be 
obtained for a wide family of functions 1 < / < /* (and, equivalently, a wide family of rate 
functions r{s) < r*(s)) where 

U = 4>oV, u(s) = <poH-\s). (3.2) 

To that goal, we follow the same lines as in [4] and [11] and introduce the pairs of Young's 
functions (H±,H2) that, by definition, satisfy the property 

xy<H 1 {x) + H 2 (y), Vx,?/>0, (3.3) 



G 



and are invertible (see e.g [19, Chapter 1]). Let X be the pairs of inverse Young's functions 
augmented with the pairs (Id, 1) and (l,Id). Examples of pairs (H\,H2) are given in [4] and 
[11] while a general construction can be found in [19, Chapter 1]. Corollary 3.2 trivially results 
from Theorem 3.1 and Eq. (3.3). 

Corollary 3.2. Assume D(C,V, <f>, b). For any pairs (^1,^2) £ X and all 5 > 0, 



E, 



J *i (r„(s)) * 2 (MX,)) ds < 2(V{x) - 1 ) + b J Q ( 1 + J ds 



For two pairs \t> 2 ) and (^' l5 *' 2 ) in J, if ^i(x) < ^[(x) for all large x, then ^2(2/) > ^(y) 
for all large y [19, Theorem 1.2.1]. This shows that the rate *$>i (r*(-)) and the function ^2 (/*(")) 
have to be balanced : the maximal rate function r* is associated to the function / with minimal 
growth in the range 1 < / < /*, that is with / = 1; and the function with the largest rapidity 
of growth / = /* is associated to the minimal rate r = 1. 

Theorem 3.1 and Corollary 3.2 thus provides a control of (/, r)-modulated moments; a simple 
condition for the rate r to be in the set A of the subgeometric rate functions is recalled in the 
following lemma [4, Lemmas 2.3 and 2.7] 

Lemma 3.3. 7/linioo f = 0, r, £ A and for all inverse Young function ^1, $ior, G A. 
Proposition 3.4. Assume D(C, V, <f), b) . Then the process is tp -irreducible. If sup c V < 00, 

(i) the level sets {V < n} are petite and the union of these level sets is full. 

(ii) there exists a closed accessible petite set B such that D(B, V, eft, b) holds and sup B V < 00. 

As a consequence, when D(C,V, <p, b) holds and sup^ V < 00, we can assume without loss 
of generality that C is accessible. 



3.2 (/, r)-regularity 

Corollary 3.2 shows that the drift condition D(C, V, (j>, b) allows the control of modulated mo- 
ments Gc(x, f, r; 5), for all 5 > and a large family of pairs (/, r). Similar modulated moments 
relative to any accessible set B can be controlled provided sup xeC Gc(x, f,r; 5) < 00 for some 
5 > (and thus any 5 > 0, as established in [11, Lemma 20]). This naturally yields the notion of 
(/, r)-regular sets. The objective of this section is to identify regular sets from the drift condition 
D(C,V,^b). 

We start with a proposition that shows that the "self-regularity" of a closed petite set C ac- 
tually implies (/, r) -regularity. This results extends [21, Proposition 4.1] (resp. [11, Proposition 
22]) that addresses the case r = 1 (resp. / = 1). It also generalizes [11, Proposition 23] which 
concerns the case r = *i(r*) and / = ^(Z*) for some pair (^1,^2) £ 1- This proposition 
is the counterpart in the sub exponential setting of the result by Down-Meyn-Tweedie for the 
exponential case [6, Theorem 7.2]. 



7 



Proposition 3.5. Let f : X — ► [1, oo) be a measurable function and r G A be a subgeometric 
rate function. Assume that the process is ip -irreducible and sup xgC - Gc(x, f, r; 5) < oo for some 
(and thus any) 5 > and some closed petite set C. 



For all accessible set B G £>(X) and all t > 0, there exists a constant cs,t < oo such that for all 
x G X, 



Hence C is (/, r) -regular. 

Proposition 3.6. Assume that D(C,V, b) holds with C,V,<f> such that sup c V < oo and 
lim +oc <f> = 0. Then for any pair (v&i, ^2) £ 2T> any accessible set B and all 5 > 0, i/iere exists 
a finite constant c such that 



Hence, any V -level set {x € X, V(x) < v} is (/, r) -regular for all pairs (/, r) = (^2 f*, ^1 ?"*) 
with (^1,^2) G 2T- 

Proof. By Corollary 3.2, sup x . gC < Gc(x, f, r; 5) < 00 for all 5 > provided the drift condition 
D(C, V, (p, b) holds and sup^ V < 00. Finally, r = $1 o r t for some inverse Young function 
is a subgeometric rate if lim +00 eft' = 0. Proposition 3.5 thus implies that the level sets of V are 
(/, r)-regular sets. □ 

We now establish a general result that extends to continuous-time Markov processes, part 
of [31, Theorem 2.1] relative to discrete-time Markov chain. In the case r = 1, some of these 
equivalences are proved in [21] for continuous-time strong Markov processes. 

Proposition 3.7. Let f : X — ► [1, 00) be a measurable function and r G A be a subgeometric 
rate function. Assume that the process is ip -irreducible. The following conditions are equivalent 

i) There exist a closed petite set C and 5 > such that sup c Gc(x, f,r;S) < 00. 

ii) There exists a (f,r)-regular closed set which is accessible. 

Hi) There exists a full set Sq, which is the union of a countable number of (f,r)-regular sets. 

Theorem 3.1 proves that these equivalent conditions are verified provided D(C,V,(/>, b) 
holds, sup c V < 00 and lim^ <fr' = 0. 

We conclude this section by establishing that under mild additional conditions, the drift con- 
dition D also yields controls of modulated moments for the skeleton chains. For all m > 0, let 
Tm,c be the return-time to C of the skeleton chain P m , 



G B (x, f, r; t) < c B ,t G c (x, f, r; 5). 



(3.4) 




*2 (/*(**)) ds <cV{x). 



T m ,c = inf{fc > l,X mk G C}. 
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Proposition 3.8. Assume that D(C, V, (f>, b) holds with sup^ V < oo, and some skeleton chain 
is irreducible. For all m > and any accessible set B, there exist constants Q, 1 < i < 4, such 
that for all x G X, 



Ex 



J2 <f>°v(x. 



k=0 



and 



Va; E X, 



mk / 



Trn.B — 1 



<ci E a 



^ o v(x, m ) <is 



y~] r*(km) 



k=0 



< c 3 E a 



r*(s) (is 



< c 2 ^(x) 



< c 4 V(x). 



We will see in the last section that this proposition which clearly links the behavior of 
the skeleton chain to that of the initial process leads to new limit theorems such as moderate 
deviations. It will also imply interesting applications to averaging principle. 



3.3 /-ergodicity at a subgeometric rate 

From the control of i h Gc(x, f,r;5) where C is a closed petite set, we are able to deduce 
results on the ergodic behavior of the strong Markov process. 

The first result concerns the existence of an invariant probability distribution tt and shows 
that the drift condition D(C,V,(^>, b) provides a simple tool when identifying the set of the 
7r-integrable functions. The second one states that the Markov process converges in /-norm to 
the invariant probability measure tt, for a wide family of functions 1 < / < /* and a wide family 
of rate functions rj < r*. 

Proposition 3.9. Assume D(C, V, 4>, b) and sup c V < oo. Then the process is positive Harris- 
recurrent with an invariant probability measure tt such that Tx{(j)o V) < oo. 

Proposition 3.9 results from [21, Theorems 1.1 and 1.2] and Theorem 3. 1 (i) . It is known that 
positive Harris-recurrence does not necessarily imply ergodicity and aperiodicity is required [23, 
Proposition 6.1]; similar conditions are required in the discrete-time case [22]. In the present 
case, we have more information than positive Harris-recurrence and thus, we are able to establish 
/-ergodicity at a subgeometric rate. 

For a sequence r G A, define r°(t) = r(s) ds, and, for a differentiable rate function r, set 
dr(t) = *g. 

Theorem 3.10. Assume that 

(i) some skeleton chain is irreducible. 

(ii) the condition D(C, V, cj), b) holds with C, V, (f) such that sup c V < oo and lim +00 (j)' = 0. 
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For any pair = (vl/i, ^2) £ X and any probability distribution A satisfying X(V) < 00, 

lim {*i(r»(t))Vl} / \(dx)\\P\x,-)-Tr(-)U 2{Myl =0, (3.5) 

where r* and /* are given 6w and X is defined in Section 3.1. Furthermore, there exist 

finite constants &2 s i such that for all t > and all igX, 

(Mr*(t)) V 1} llP^O-vrOll^^vi^C*,!^), (3-6) 

/■oo 

/ {*i(r,(t))Vl} ||P*(x,-)- J P*(y,-)ll* 2 (/,)vi^<^,2{^) + ^(y)}; (3.7) 
J 

and i/9[^ r i(r*)] 6 A, i/iere exists a finite constant C^^ such that for all t > 0, 

;>oo 

/ {5[*i(r,)](t) V 1} \\P\x, •) - 7r(0lk 2(/ ,)vi * < C %3 V(x). (3.8) 

JO 

The limit (3.5) is a direct application of [11, Theorem 1] while (3.6) to (3.8) are, to our best 
knowledge, new results. The proof of this theorem is detailed in Appendix A. 
As already commented in [11], Eq. (3.5) shows that the rate of convergence and the norm in 
which convergence occurs have to be balanced : if ^1 strongly increases at infinity then ^2 
slowly increases (see [19] and the comments in Section 3.1). Hence, the stronger the norm, the 
weaker the rate and conversely. The maximal rate of convergence is achieved with the total 
variation norm (^2 /* = 1) an d the minimal one (^2 ° r* = 1) is achieved with the /*-norm. 
Hence, the drift condition D(C,V, </>, b) directly provides two major informations: the largest 
rate of convergence r* = (j) o H^ 1 is given by the concave function 4> and the largest norm of 
convergence || • is given by the pair (cj), V). 

Eqs. (3.6) to (3.8) are, to our best knowledge, the first results that address the dependence upon 
the initial point in the ergodic behavior. When applied to discrete-time Markov chains, (3.6) to 
(3.8) coincide with resp. [31, Theorems 2.1, 4.1, 4.2] (the dependence upon x can be read from 
the proof of these theorems; the details are also provided in [9, Chapter 3]). These results for 
the discrete-time case and the definition of the set in [11, Theorem 1] suggest that in (3.6), 
the minimal dependence in the starting value x is of the form Gc(x,^2(f*),^i{f Jf );5). Similar 
expressions can be predicted for (3.7) and (3.8). The proof of this assertion and the explicit 
construction of the constants C^j in terms of the quantities appearing in the assumptions are 
beyond the scope of this paper. Currently in progress is work on explicit control of subgeometric 
ergodicity for strong Markov processes. 

In the examples given in Section 4, we will see that the pair (4>,V) that solves D(C,V,</>, b) 
is not unique. Roughly speaking, we read from Theorem 3.10 that <j) is related to the rate of 
convergence in total variation norm, while V is the dependence upon the initial point in the 
control of convergence. As a consequence, the rate of convergence rAt) and the dependence 
V{x) can be balanced to make the bounds (3.6) to (3.8) minimal. In Section 4, we will give 
some examples (on X = R n ), where both a pair of polynomially increasing functions and a pair 
of subgeometrically increasing functions can be found. One then immediately remarks that the 
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stronger the control in the initial point is, the stronger the decay in time is for a given norm. It 
stresses once again the interest for exact constant in our controls to decide which "ergodicity" 
to use to reach a certain level. The fact that the pair (0, V) is not unique shows that the 
drift condition only provides an upper bound of the true rate of convergence. Nevertheless, in 
many applications, we are able to prove that the true rate belongs to the exhibited class of rate 
functions (see for example, section 4.2). 

3.4 Generator and drift inequality (3.1) 

The drift condition D(C, V, <f>, b) may not be easy to derive since it is couched in terms of the 
process itself. The main goal of this section is to provide an easier path to ensure subgeometric 
ergodicity, which is moreover the usual form of conditions adopted on earlier paper to address 
different classes of stability. Namely we will use the formalism of the extended generator [3, Def. 
1.15.15]. 

Let T>(A) denote the set of measurable functions / : X — > R with the following property: 
there exists a measurable function h : X — > R such that the function t \— > h(Xt) is integrablc 
P-z-a.s. for each i£X and the process 

t ^ f(X t ) - f(X ) - [ h(X s )ds (3.9) 
J o 

is a Pjr-local martingale for all x. Then we write h = Af, and / is said in the domain of the 
extended generator (A,T>(A)) of the process X. The condition (3.1) looks like a Dynkin formula. 
This is the reason why we want it to hold as widely as possible, thus justifying the interest in 
the extended generator concept. 

Theorem 3.11. Assume that there exist a closed petite set C , a cad-lag function V : X — ► [1, oo) 
with V E T>(A), an increasing differentiate concave positive function 4> ■ [1, oo) — ► (0, oo) and a 
constant b < oo such that for all x E X, 

AV(x) <-cf)oV(x) + blc(x). (3.10) 

Then D(C, V,0,b) holds. 

The proof is in Section A. 7. The extended generator is less restrictive than the infinitesimal 
generator A : if / is in the domain of A, then the process (3.9) is a martingale and / is in 
the domain of A (see e.g. [3, Proposition 1.14.13]). In particular, it is often quite difficult to 
characterize the domain of A but there may be (and are, in the applications of Section 4) easily 
checked sufficient conditions for membership of T>(A). 

This drift condition naturally inserts in the existing literature, that addresses criteria for non- 
explosivity, recurrence, polynomial ergodicity, geometric and uniform ergodicity. More precisely, 
Meyn and Tweedie provide conditions for non-explosion, recurrence, positive-Harris recurrence 
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and ^/-ergodicity at a subgeometric rate, respectively of the form 



AV(x) < cV(x), 

AV{x) < cl c (x), 

AV{x) < -cf(x) + bl c (x), 

AV(x) < -cV(x) + bl c (x) 



(3.11) 
(3.12) 
(3.13) 
(3.14) 



for some positive constants 6, c < oo and a measurable function / > 1 (see [24, Conditions (CDO) 
to (CD3)]; see also [6] for the condition (3.14)). These criteria are similar to some conditions 
provided by [16] for the stability of stochastic differential equations. The drift inequality (3.14) 
is the limit of our approach, since it corresponds to (3.10) with 4>(v) oc v. 

In a recent work, Fort and Roberts [11] considered a family of drift condition that implies /- 
ergodicity at a polynomial rate : namely, there exist 0<a<l,6>0 such that for all a < rj < 1, 
there exists > such that 



Our drift condition (3.10) with 4>(v) oc yields the same results as those provided in [11, 

Theorem 1, Lemma 25, Proposition 26] (see Theorem 3.10 and Proposition 3.1). Hence, the drift 
inequality (3.10) that addresses subgeometric ergodicity generalizes the criterion for polynomial 
ergodicity proposed by Fort-Roberts. The comparison of the Fort-Roberts nested drift conditions 
(3.15) and our single drift condition can be more explicit when V S *D{A) and the process (3.9) 
is a P^-martingale for all x. In that case, it is easily seen that the single drift condition implies 
the nested drift conditions. The martingale property is equivalent to 



is a P^-martingale for all x, where H(lnV) = V 1 AV [8]. Furthermore, H(ln V) < —g + s if 
and only if 



is a P x -supermartingale for all x [8]. As a consequence, if V v is in the domain of A for all 
< T] < 1 then the Jensen's inequality yields H(r] In V) < r/ exp(-alnV) + 6r/exp(— lnl/)lc 
which in turn implies (3.15). 

3.5 Resolvent and drift inequality (3.1) 

One of the approaches for studying the stability and ergodic theory of continuous time Markov 
processes consists in making use of the associated discrete time resolvent chains. This allows 
to take profit of the analysis of discrete time Markov chains which is quite well understood 
([25, 22]) and then to transfer properties established in terms of the resolvent or "generalised 
resolvent" kernel (see for e.g. [21]) to the Markov process itself. Following the discussion (done 



AV>(x) < -CrjV^ix) + bl c {x). 



(3.15) 





12 



for exponentially ergodic Markov process) by Down-Meyn-Tweedie [6, Th.5.1] and extending it 
to the subgeometric case, we will now link the drift condition D(C, V, (j), b, (3) associated to the 
Markov process to a drift condition associated to the discrete time resolvent chain. 

More precisely, define, for (3 > 0, the resolvent kernel Rp by Rp(x,A) = J °° (3e~^ 1 P t (x, A)dt 
and consider the following drift condition associated to the resolvent kernel. 

D(C, V, (f>, b, [3): There exist a petite set C, a function V : X — » [l,oo), an increasing 
differentiable concave positive function <j) : [1, oo) — ► (0, oo) and a constant b < oo such 
that for any x E X, 

RpV{x) < V(x) -(f>o V(x) + bl c (x). (3.16) 

The following result ensures that drift conditions expressed in terms of the resolvent kernel 
or of the Markov process are essentially equivalent. This theorem parallels Theorem 5.1. by 
Down-Meyn-Tweedie [6] for exponentially ergodic Markov processes. 

Theorem 3.12. (i) Assume D(C,V, (j), b,/3) where C is a closed set and RpV is a cad-lag 
function. Then D(C, R^V, /3(j> } /3b) holds. 

(ii) Assume D(C,V, (f), b) with sup c V < oo. Then, for all e > 0, there exists a constant c 
such that for all x £ X, 

n(t) 

W(x) < (1 + e)V(x) + c and lim — — = 1 + e 

t^oo r ((l + e)t) 

such that D(C, W, <f>, b, /?) holds. 
The proof is given in Section A. 8. 

4 Examples 

In this section, X = W 1 . Vectors are intended as column vectors, \x\ and (•, •) denote respectively 

the Euclidean norm and the scalar product. For a matrix a, \a\ = j a ij j i Tr(a) stands 
for the trace of the matrix and a' the matrix transpose. Id n is the nx n identity matrix. If V is 
a twice continuously differentiable function with respect to x £ W 1 , dV (or d x V when confusion 
is possible) denotes its gradient, and d 2 V its Hessian. 
For a set A, A c is its complement in W 1 . 

Four applications are considered: we first analyze general elliptic diffusions on M n such that 
the drift coefficient verifies a contraction condition of the form (b(x),x) < — r|x| 1_p for all large x, 
where < p < 1. We then consider a Langevin diffusion on W 1 having an invariant distribution 
which is super-exponential in the tails, and show that the rate of convergence can be modified 
by "heating" the diffusion. The method is however not limited to elliptic diffusions but can also 
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be of use in the hypoelliptic case: we consider as an illustration a simple stochastic damping 
Hamiltonian system which cannot be exponentially ergodic but is shown to be sub exponentially 
ergodic. We finally study a compound Poisson-process driven Ornstein-Uhlenbeck process when 
the distribution of the jump is heavy tailed. 

Queuing theory is another important field of application for our theory. We do not discuss 
here this field of applications. This will be done in a forthcoming paper, which will also include 
a comparison of our results to those by [2, 34]. Techniques in Dai-Meyn [2] differ from ours 
since they are based on fluid limits. Concerning [34], our conditions are more general; indeed 
the authors assume that there exists a state xq such that whenever the Markov process hits xq, 
it will sojourn there for a random time that is positive with probability 1, [34, Assumption 1.1]. 
This assumption makes their results unavailable for the applications we now consider. 

4.1 Elliptic diffusions on W 1 

Consider the stochastic integral equation of the form 

X t = X + f b{X s )ds + f a(X s )dB s , (4.1) 
J o J o 

where X t G W 1 , b : W 1 —> W 1 and a : R n —> R nxn are measurable functions, and {B t } t is a 
n-dimensional Brownian motion. Assume that b : R n — > M. n and a : M n — > R nxn are functions 
satisfying 

Al ex is bounded and b and a are locally Lipschitz : for any I > 0, there exists a finite constant 
q such that for all |x| < I, \y\ < I, 

\b(x) - b(y)\ + \a(x) - a(y)\ < q\x - y\. (4.2) 

Let a(x) = <j{x)o{x)' be the diffusion matrix. We assume that 

A2 (i) a(x) is non-singular : the smallest eigenvalue of the diffusion matrix a(x) is bounded 
away from zero in every bounded domain. 

(ii) there exist < p < 1, r>0 and M such that for all \x\ > M, (b(x), x) < — r|x| 1_p . 

Note that under Al, A = n _1 sup a , gR n Tr(a(x)) and A + = sup a; _ ! i (a(a;)|^| , ^} are finite. More- 
over, since under Al a is continuous, the assumption A2(i) is equivalent to the condition 
det(<r(x)) / for all x. 

Under Al, it is possible to define continuous functions bi and 07 that satisfy the at most 
linear increasing 

+ < 0,(1 + 1x1), VxGM", 

the Lipschitz condition (4.2) on the whole state space, and are such that bi = b and 07 = a on 
the cylinder {x S M. n , \x\ < I}. The stochastic equation (4.1) has a unique i-continuous solution 
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{X^}t, when b and a are replaced by bi and 07 [16, Theorem 3.3.2]. The first exit times of 
{X[ m) } t from {x £W l ,\x\ < 1} are identical for all m > I (and is thus denoted Q). This allows 
the construction of a process {Xt}t that satisfies (4.1) up to the explosion time £ = lim/^. If 
C = +oo a.s., {Xt}t is a.s. defined for all t > and the process is said regular. Under the stated 
assumptions, an easy to check sufficient condition for regularity relies on the operator L that 
acts on function V : W 1 , x i— » V{x) that are twice continuously differentiable with respect to x: 

LV{x) = (b{x), dV{x)) + ^Tr (a(x) d 2 V{x)) . (4.3) 

The process is regular if there exists a non-negative twice-continuously differentiable function 
V on W n such that for some finite c, LV < cV on W 1 and inf| x | >R V{x) — > oo as R — > oo [16, 
Theorem 3.4.1.]. Under A2(ii), it is trivial to verify that by setting V(x) = 1 + \x\ 2 , 

LV(x)<{ - 2r l x l 1_P + nA ' if \x\ > M, 

| 2M supi x i<j^ \b(x)\ + nA otherwise. 

This shows that the process is regular. Consequently, there exists a solution to (4.1), which is 
an almost surely continuous stochastic process and is unique up to equivalence. This solution is 
an homogeneous Markov process whose transition functions are Feller functions [16, Theorem 
3.4.1]. Hence, it is strongly Markovian, as a right-continuous Markov process with Feller tran- 
sition functions. We now discuss the existence of an irreducible skeleton P m and the petiteness 
property of the compact sets. All of these properties deduce from the existence of an unique 
invariant probability distribution tt. 

Proposition 4.1. Under A1-A2, X possesses an unique invariant probability measure tt. tt is 
a maximal irreducibility measure and any skeleton P m is irreducible. Furthermore, the compact 
sets are closed petite sets. 

Proof. By (4.4), [16, Theorem 3.7.1] and its corollary 2 [16, p. 99], there exists a bounded 
domain U with regular boundary and a finite constant c such that for all x E U c , K x [Tjjc] < oo 
and for any compact fC C W 1 , sup^g^E.,; [Tjjc] < oo, where 

Tu = mf{t>0,X t (jtU}. 

Since the diffusion matrix a(x) is non-singular, we deduce from [16, Theorem 4.4.1 and Corol- 
lary 2 p. 123] that the process possesses an unique invariant probability distribution tt. [16, 
Lemma 4.6.5] implies that any skeleton is (^-irreducible, with an irreducibility measure abso- 
lutely continuous with respect to the Lebesgue measure. By [16, Lemma 4.6.1], the support 
of tt has non-empty interior; since the process is ■0-irreducible and has the Feller property, all 
compact subsets of R n are petite (this assertion can be proved in exactly the same way as in 
the discrete-parameter case [22, Proposition 6.2.8]). □ 

Under Al-2, it si easily checked that any twice continuously differentiable function V : M n — > 
E is in the domain of A and LV(x) = AV(x) for all Observe indeed that t i— > LV(X t ) 
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is integrable P^-a.s. for all x S W 1 and t \— > V(-Xt) — V(Xo) — f LV(X s )ds is a right-continuous 
local martingale. Hence V £ T^{A) and LV = AV . We now establish drift inequalities for 
different test functions V. 

Proposition 4.2. Assume Al-2. Let V : W 1 — > [1,+co) be a twice continuously differentiable 
function such that V{x) = exp(t \x\ m ) outside a compact set, for some < m < 1 and i > 0. 
Then s\ip\ x \ <M AV(x) < oo and for all \x\ > M, 



(%) 7/0 < m < 1 - p, 

AV(x) < -t^mr [lny(x)] 1_ ^ j V(x) (1 + o(l)) ; 
(ii) If m = 1 — p, 

-4U(x) < -p){r- (l/2)A+t(l -p)} [lnV(x)]~ 2 ~V(x) (l + o(l)). 

Proof. Under the stated assumptions, sw£>i x \ x \<m}AV(x) < oo. By definition of A, we have for 
all |x| > M, 

AV(x) < -urn (r - (l/2)A+tm|x| p+m - 1 ) \x\ m - x - v V(x) + (l/2)tmnA\x\ m - 2 V{x). 

□ 

As a direct application of Proposition 3.9 and Theorem 3.1(h), we have 
Theorem 4.3. Assume Al-2. 

(i) For all l> suc/i £/ictt r - (l/2)A+i(l - p) > 0, 

y ir(dx) exp(i|x| 1_p ) < oo, 

where ir is the invariant probability distribution of the Markov process that solves (4-1) • 

(ii) There exists a closed petite set C such that for any < to < 1 — p, < ii < i-2 and 5 > 0, 
there exists a finite constant c such that 

m "I 

K x [exp(n {r c (5)}i+p)J < cexp(^\x\ m ). (4.5) 

If m = 1 — p, (4-5) still holds for any < i\ < £2 swc/i i/iat r — (1/2)^2 A+(l — p) > 0. 

The results of Theorem 4.3 can be compared to those by [20], where subexponential ergodicity 
in total variation norm of a diffusion satisfying the conditions Al-2 is addressed. The technique 
used in [20] is based on the coupling method. Theorem 4.3(i) states the same result as [20, 
Lemma 3]. Nevertheless, Theorem 4.3(h) yields a stronger control of delayed return-time to 



16 



a closed petite set than those obtained in [20, Theorem 5]. They show that for all < a < 
(1/2)(1 — p) there exists a constant c a such that 

E x [exp(T C (5) a )} < c a exp(|x| 2a ), 

and this remains valid for a = (1 — p)/2 if r — (1/2)A+(1 — p) > 0. Theorem 4.3(h) claims 
that for all < a < (1 -p)(l + p)' 1 and i > 1, E x [exp(Tc(<5) a )] < c Q exp(i and for 

a = (1 -p)(l E x [exp(iirc(5) a )] < c Q exp(i 2 for all < n < t 2 such that 

r- (1/2) t2 A+(l-p) >0. 

As a direct application of Theorem 3.10, we obtain the following results for /-ergodicity at a 
subgeometric rate. 

Theorem 4.4. Assume A 1-2 and let ir be the invariant probability distribution of the Markov 
process that solves (4-1)- Then the process is subgeometrically f-ergodic: for any x 6 M. n , the 
limits (3.5) to (3.8) hold with V(x) ~ exp(i|x| 1_p ) for some positive i such that r— 0.5A+t(l— p) > 
0, f,(x) ~ |x|- 2 Pexp(t|x| 1 -f) andr*{t) ~ t" 2 P/(i+p) exp({i't}( 1 -P)/( 1+ P)) where 

t ' = t i=F(i + p ) { r - (l/2)A+i(l - p)} . 

In [20], only the convergence in total variation norm of the semi-group {P t }t>o to the in- 
variant probability ir is addressed: is is established that the process is ergodic at the rate 
r^(t) oc exp((5^ 1 ~ p ^ 2 ) for some 5 > 0, and in that case, the dependence upon the initial point 
in (3.5) is V M (x) ~ exp(5|x| 1_p ). Theorem 4.4 improves these results and also provides rates of 
convergence in /-norm for unbounded functions /. 

We reported in Theorem 4.4 the values (V, /*, r*) that yield the best rate of convergence in total 
variation norm. Proposition 4.2 shows that one could establish the drift inequality (3.10) with 
V(x) ~ exp(i\x\ m ) for some < m < 1 — p; this would imply the limits (3.5) to (3.8) with 
V{x) ~ exp(t|x| m ), U(x) ~ |x| m - 1 - p exp(i|3;| m ) and r»(t) ~ t("»-i-f)/(i+p) exp(o'\x\ m ^ 1+p ^) for 
all < o' < i. We thus obtain a weaker maximal rate function r#, and a weaker maximal norm 
|| • ||/,, but this has to be balanced with the fact that the dependence upon the initial value 
(i.e. the quantity V(x)) is weaker too. Similarly, polynomially increasing controls V{x) could 
be considered, thus limiting the rate r* (resp. the function /*) to the class of the polynomially 
increasing rate functions (resp. to the class of the polynomially increasing function). These dis- 
cussions illustrate the fact that the pair (<j>,V) that solves (3.10) is not unique, and this results 
in balancing the pair (r*, /*) and the dependence upon the initial value x. 

4.2 Langevin tempered diffusions on W l 

Let 7r : M. n — > (0, oo) satisfying 

Bl -/r is, up to a normalizing constant, a positive and thrice continuously differentiable density 
on M n , with respect to the Lebesgue measure. 
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Let ct(x) = | hi7r(x)| rf for some d > and define the diffusion matrix by a(x) = <7 2 (x)Id n , and 
the drift vector by b(x) = (b±(x), ■ ■ ■ , b n (x))' where 

n n 

bi(x) = (1/2) aij{x) d Xj log^(x) + (1/2) ih«,jU). l<i<n. 

Observe that since ir is defined up to a normalizing constant, we can assume that cr(x) > for 
all x. Our objective is to study the ergodicity of the solution to the stochastic integral equation 

X t = X + [ b(X s )ds + [ a(X s )dB s (4.6) 
Jo Jo 

where {B{\t is a n-dimensional Brownian motion. This diffusion is the so-called Langevin 
diffusion and the drift vector b is defined in such a way that ir is, up to a multiplicative constant, 
the density of the unique invariant probability distribution. Note that this model is not a 
particular case of the elliptic diffusion of section 4.1 since here, a may be an unbounded function 
{a = | ln7r(x)| d ). 

Fort and Roberts investigate the behavior of these diffusions when tt is polynomially decreasing 
in the tails and address ergodicity in total variation norm and in /-norm as well [11]. They 
consider the case <j{x) = ir~ d {x) (d > 0) and show that the rate of convergence in total variation 
norm and in /-norm for f{x) ~ ir~ K (x) (k > 0) depends on d. When d is lower than some critical 
temperature d*, the process is ergodic at a polynomial rate, and when d is larger than d*, the 
process is uniformly ergodic in total variation norm and geometrically ergodic otherwise [11, 
Theorem 16]. Fort and Roberts thus proved that the rate of convergence can be improved by 
choosing a diffusion coefficient a which is small when the process is close to the modes of ir and 
big when it is far from the modes. The objective of this section is to investigate the case when 
tt is super-exponentially decreasing in the tails. We assume that 

B2 there exists < [3 < 1 such that for all large \x\, 

Ix) 1 "' 3 (ainvr(x),^) < 0, 
| a; | 

< liminf I6" In vr(x) 1 1 In vr(x) | < limsup |<9hi7r(x)|| In vr(2;)| 1//3_1 < oo, 

X >00 x — >oq 

limsupTr (d 2 ln7r(x)) |<91n7r(x)|~ 2 =0. 

x— >oo 

The class of density tt described by Bl-2 contains densities that are super-exponential in the 
tails. The Weibull distribution on (0, oo) with density n(x) oc x@~ l ex.p(—ax^) satisfies B2. For 
multidimensional examples, see e.g. [18, 28, 10]. Following the same steps as in Section 4.1, 
we can prove that under Bl-2 and provided the process is regular, there exists a solution to 
(4.6) which is an almost surely continuous stochastic process and is unique up to equivalence. 
This solution is an homogeneous strong Markov process whose transition functions are Feller 
functions. Under B2, the process is regular whatever d > 0; this can be proved as in the previous 
section (by choosing V = 1 + 7r~ 2 , see (4.7) below). 
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These assumptions also imply that tt is (up to a scaling factor) the density of an invariant 
distribution of the diffusion process, any skeleton chain is ^-irreducible and compact sets are 
closed petite sets ([11, Proposition 15]). 

Let V : W 1 — > [1, oo) be a twice-continuously differentiable function such that V(x) = l + 7r _K (a;) 
outside a compact set; then AV(x) = LV(x) = £i(x) + ^(^O where L is the diffusion operator 
(4.3) and for large \x\, 

and l2{x) = o(£i(x)). In [30, Theorem 3.1], it is established that the process is geometrically 
ergodic if and only if d > 1/(3 — 1. From (4.7), we are able to retrieve these results and we also 
prove that when < d < 1/(3 — 1, the process is subgeometrically ergodic. Observe indeed that 
for large (4.7) and B2 imply 

AV(x) < -c K [lnV(x)]~ a V(x), where a = 2(1/(3 - 1 - d), and c K > < k < 1. 

Hence, if a < 0, the process is ^-geometrically ergodic [21, Theorem 6.1] (see also section 3.4); 
if a > 0, it is subgeometrically ergodic as a consequence of Theorems 3.10 and 3.11. 
A polynomially increasing drift function can also be considered: we can assume without loss of 
generality that for large x, ln7r(x) < since n is defined up to a multiplicative constant. We 
thus set V(x) = 2 + sign(ft) (— ln-7r(x)) K outside a compact set. Then for large x, 

\~\t t ^ M i i / N^+i+2(d-i//3) ( |dm7r(z)| \ 2 
AV(x)<- Y (-\nn(x)) { \i n7r{x) \i-W ) + 

and there exists a constant c > such that for large x, 

AV{x)<-cV 1 ~ a {x), wherea = 2K- 1 (l//3-d-(l/2)). (4.8) 

First consider the case when k>0. If 1 / /3 — 1 < d < 1/ (3 — (1/2), the drift condition (4.8) and 
Theorems 3.10 and 3.11 yield polynomial ergodicity. For example, this implies convergence in 
total variation norm at the rate r(t) ~ i 1 ' 0-1 . If d = 1/(3 — (1/2), then a = and the process 
is geometrically ergodic. In the case when k can be set negative and 1 — a > i.e. when 
d > 1/(3 — (1/2), the process is uniformly ergodic: there exist A < 1 and a constant c < oo such 
that for all x, 

X- 1 ||P*(x,.)-7t(.)||tv<c, 
and the convergence does not depend on the starting point. 

The above discussions are summarized in the following theorem. The first part (resp. third part) 
results from [30, Theorem 3.1] (resp. [21, Theorem 6.1]). The second assertion is a consequence 
of Theorem 3.10. The last assertion was already proved by [30, Theorem 3.1] for one-dimensional 
diffusions (n = 1). 

Theorem 4.5. Consider the Langevin diffusion on R™ solution to the equation (4-6) where the 
target distribution tt satisfies Bl-2. 
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(i) If < d < 1//3 — 1, the process fails to be geometrically ergodic. 

(ii) If < d < 1/(3 — 1, the process is subgeometrically ergodic: the limits (3.5) to (3.8) hold 
with V(x) ~ ir- K (x), /*(f) ~ tt- k (x) |ln7r(x)r 2(1//3_1_d) and lnr*(i) ~ c^l^P-™® for 
all < k < 1. 

(Hi) If d > 1/(3—1, then for all < k < 1, i/ie diffusion is V -geometrically ergodic with 
V(x) = 1 + ii- K {x). 

(iv) If d > 1/(3 — (1/2), the diffusion is uniformly ergodic. 

This theorem extends earlier results to the multi-dimensional case and provides subgeomet- 
rical rates of convergence of the 'cold' Langevin diffusion, for a wide family of norms. We 
established that for a given 7r _K -norm, the minimal rate of convergence is achieved with d = 
and in that case, the rate coincides with the rate of convergence of the symmetric random-walk 
Hastings-Metropolis algorithm ([4, Theorem 3.1]). This rate can be improved by choosing a 
diffusion matrix which is heavy where ir is light and conversely. When d is larger than the 
critical value d* = 1/(3 — 1, the process is geometrically ergodic; when d is lower that d*, the 
process can not be geometrically ergodic and we prove that it is subgeometrically ergodic. The 
conclusions of Theorem 4.5 are similar to those of [11, Theorem 16], that address the case when 
7r is polynomial in the tails. 

We assumed that a = | ln-7r| d . A first extension is to consider a sufficiently smooth function a 
such that o~(x) ~ | ln-7r(x)| rf for large \x\; this yields similar conclusions and details are omitted. 
A second extension consists in the case when <r(x) ~ 7r~ d (x). In this latter case, following the 
same lines, it is easily verified that for small enough values of d, the process is regular (the set 
of the admissible values is in the range (0, 1/2]), and the process is ^-geometrically ergodic with 
a test function V(x) ~ ir~ K (x), k > 0. Details are omitted and left to the interested reader. 



4.3 Stochastic damping Hamiltonian system 



Both examples of the previous sections assumed that the diffusion process is elliptic. However 
the drift condition (3.10) enables us to consider also hypoelliptic diffusion that we will illustrate 
on the example of a simple stochastic damping Hamiltonian system, i.e. let xt (resp. yt) be the 
position (resp. the velocity) at time t of a physical system moving in M n 

dX t = Y t dt 

dY t = V(X u Y t )dBt - (c(X t ,Y t )Y t + d x U{X t ))dt 1 ^ 

where —d x U is some friction force, —c(x,y)y is the damping force and T,(x,y)dB is a random 
force where (Bt) is a standard Brownian motion in R n . This system has been studied from the 
large and moderate deviations point of view by Wu [33] where he also establishes the exponential 
ergodicity under various set of assumptions. 

As our goal is not to consider the model in its full generality but to illustrate the subexpo- 
nential behavior of hypoelliptic diffusion, via the simple use of drift condition (3.10), hereafter 
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we will consider the particular (but also current in practice) case where the damping and ran- 
dom forces are constant c(x,y) = c ld n and £(x,y) = a Id n , c and a being positive constants 
(as, if one is identically equal to 0, there is none of the usual ergodic properties such as positive 
recurrence). We will assume moreover that the potential U is lower bounded and continuously 
differentiable over M. n . In this case, the system is known to have an unique invariant measure 
given by 

n(dx,dy) = e~^ H( - x ' v Uxdy 

where H is the Hamiltonian given by H(x,y) = ^\y\ 2 + U(x). 

Let us first ensure the existence of solutions and aperiodicity for the process Zt = (Xt,Yt) 
via the following proposition due to Wu [33, Lemma 1.1, Proposition 1.2] 

Proposition 4.6. For every initial state z = (x,y) £ R 2n , the SDE (4-9) admits an unique 
weak solution P z which is non explosive. Moreover denoting (P t (z,dz'))t the associated semi 
group of transition, we have that for every t > and every z 6 ]R 2n , P t (z,dz') = pt{z, z')dz' 
and pt(z, z') > 0, dz' — a.e. The density pt(z, •) is moreover continuous, and the process is thus 
strongly Feller. 

As a consequence, the solution is a strong Markov process, all the skeletons are irreducible 
and compact sets are petite sets. 

Let us build an example of polynomially ergodic stochastic damping Hamiltonian system in 
dimension 1. We rewrite the system as 

dX t = Y t dt , 
dY t = adB t - (cY t + U'(X t ))dt, ^' iUJ 

and assume that U is C 2 , and there exist < p < 1 and positive constants a, b such that for \x\ 
large enough 

a\x\ p - 1 < U\x) < b\x\P- 1 . (4.11) 

The fact that p is less than 1 implies that (Zt)t>o cannot be exponentially ergodic [33, Theorem 
5.1]. We now exhibit a drift function satisfying (3.10). Consider positive constants a,f3 and a 
smooth positive function G such that for m, 1 — p < m < 1, G'(x) = \x\' m for large |x|; define a 
twice continuously differentiable function V > 1 such that for large x, y, 

V(x, y) = a(y 2 /2 + U{x)) + (3(G'(x)y + cG{x)). 

By definition of A, it holds 

AV m (x, y) = ^a 2 d 2 V m (x, y) + y d x V m (x, y) - (cy + U'(x)) d y V m (x, y) 



so that 



AV m (x, y) = l -aa 2 + y(aU\x) + (3G'\x)y + (5cG'{x)) - [cy + U'(x))(ay + 0G'(x)) 
= ^aa 2 + (pG"(x) - ca)y 2 - f3G' (x)U' (x). 
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Fix 5 < 0; since m < 1, we choose j3 small enough so that /3G"(x) — ca < 5 < for all large 
x. Furthermore, for all large G'(x)U'(x) > b\x\ p ~ 1+m . Hence, there exist positive constants 
K, L such that 

p — l-\-m 

AV m (x,y) <K-L V m (x,y)^+^. 

p — 1 + m -I , 

Condition (3.10) holds with 4> m (v) oc v m + 1 and p ~ m+ { n < 1. Application of the results of 
Section 3.3 now implies that the process (Zt)t>o is polynomially-ergodic. 

Let k > 1 and define a twice continuously differentiable function V m k > 1 such that for large 

x,y 

V m , k {x,y) = V£(x,y). 



Then for large x, y, the above calculations yield 



1 



AV m>k {x,y) = {AV m {x,y))V* 1 (x,y) + -<j 2 (d y V m {x,y)) 2 V^- 2 {x,y) 
< {K' - LV m {x,yf$r)V*-\x,v) 



< K" - L'Vr m+1 



71 + 1 

m 



for some positive constant K',K",L'. This inequality is once again the condition (3.10) with 
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fc(f) = iAm+i + ' . These discussions are summarized in the following Theorem. 



Theorem 4.7. Let U be a twice continuously differentiable function, lower bounded on R satis- 
fying (4-11) for some < p < 1. T/ien (Zt)t>o is not exponentially ergodic but is polynomially 
ergodic : for any m such that 1 — p < m < 1 and any k > 1, t/ie limits (3.5-3.8) hold with V m k 

defined above, 4> m ,k(v) oc - 1 , /* = ^> m ,fc o V^,^ and oc i 2 -p 

Observe that the process (Zt)t>o is polynomially ergodic at any order and we strongly believe 
it is sub exponentially ergodic. This sub exponential case is left to the interested reader. The 
multidimensional case is more intricate in the choice of the drift function and we do not pursue 
here in this direction. 

This example shows that our conditions are sufficiently flexible to consider the hypoelliptic 
diffusions as well as the elliptic ones. 



4.4 Compound Poisson-process driven Ornstein-Uhlenbeck process 

In this section we consider an example of Fort-Roberts [11] where subgeometric ergodicity can 
be achieved where they only obtain polynomial ergodicity. Let us first recall the model. Let X 
be an Ornstein-Uhlenbeck process driven by a finite rate subordinator: 

dX t = -tiX t + dZ t 
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and Zi = X/iSi Ui, where (£/j)i>i is a sequence of i.i.d.r.v. with probability measure F, and 
(Nt) is an independent Poisson process of rate A. We suppose the recall coefficient \i to be 
positive. Remarking that only when F is sufficiently (even extremely) heavy tailed, X fails to 
be exponentially ergodic, Fort-Roberts [11] give conditions for which X is polynomially ergodic. 
Namely, denote G the law of the log jump sizes (G(A) = F(e A )), and assume that for all k > 0, 
J e KX dG(x) = +00. Lemma 17 of Fort-Roberts then prove that X is not exponentially ergodic 
and give examples where X is positive recurrent and polynomially ergodic, namely when for 
some r > 1, J °°[log(l + u)] r F(du) is finite. Such assertion may be useful considering 

(7-1 

F(dx) = jz k . ... dx k > 1 

x(log(x)) k 

C ~l e -c{\og(x))P 

F(dx) = dx (3<1. 

x 

We shall strengthen their result by 

Proposition 4.8. Suppose that (Xt) is aperiodic and that for some 5 < 1, a > 

'■00 



/ e a ^ 1+x » S F{dx) < 00. 
J 



Then, the conclusions of Theorem 3.10 hold with V{x) = e a '( lo s x ) s (and a' < a if 5' = 5), and 
<f>{v) = v^- 5 'y & ', n(t) = at-W^e* 6 ' , f* = (f>oV. 

Proof. We shall use the drift conditions introduced previously for the generator defined by 
for all functions V in the extended domain of the generator 

/>oo 

AV{x) = A / (Y(x + u) - V{x))F{du) - fixV'(x). 



Choosing V(x) = (log(x)) r , as in Fort-Roberts [11, Lemma 18], for sufficiently large x ensures 
the polynomial ergodicity at the previous rate. Consider now V(x) = e a '^ x ^ , so that 

roc . t p a'(logx) s 

AV(x) = A / ( e °'( lo g(^+«)) 5 _ e *'(^r ) F ( du ) _ c'S'fjL ,^ gl 



(log x) 



< -a' 2 ' 6 ' uS'- V n + b 

(logV)^- 6 )/ 6 



recalling that for large x 



«i(\o g { x+ u)Y _ e a'(io g ^ „g,e g(l + u/x ); 

(logx) 1 ^ 6 



the dominated convergence theorem ends the argument. 
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5 Skeleton chain and moderate deviations 



We consider here an important field of application for this subgeometric rate, namely moderate 
deviations for bounded additive functionals of Markov process. In fact, Proposition 3.8 gives 
us more than a way to deal with subexponential ergodicity, it also implies a drift condition in 
the sense of Douc-Fort-Moulines-Soulier [4] which will enables us, at least in a bounded test 
function framework, to extend to the continuous time case some limit theorems tailored for the 
subexponential regime by Douc-Guillin-Moulines [5] such as moderate deviations. Moderate 
deviations are concerned with the asymptotic for centered g with respect to tt and for < t < T 
of 

t 



st = 7W)l 9{Xs,Ms 



where as e tends to 0, h(e) — > oo but y/eh(e) — > 0, namely a regime between the large deviations 
and the central limit theorem. We may then state (proofs will be done in appendix. ) 

Theorem 5.1. Assume that D(C, V, cj>, b) holds with sup c V < oo, and some skeleton chain is 
ip -irreducible. 

(i) For all m > 0, there exist a function W : X — > [0(1), oo), a small set C for the skeleton 
P m and a positive constant b' such that sup^ W is finite, and on X, 

P m W <W - cpoW + b'l d , and (poV<W<nV. 

(ii) Assume that Xq is distributed as fi and fi(V) < oo and that g is a bounded mapping from 
X to W n with n(g) = 0. Suppose moreover that for all positive a 



oc 



then (S e G •) satisfies a moderate deviation principle in Cb([0, l],R n ) (the space of con- 
tinuous functions from [0, 1] to W 1 starting from 0) equipped with the supremum norm 
topology, with speed j^r^j an d rate function Ig, i.e. for all Borel set A £ Co([0, 1],R") 

- mf /^7) < liminf-^ T logP M (5 e e^) 

< limsup— UlogP^Sf €A)<- mf 

where Ig is given by 



sup <j (j(t), - -o- 2 ((g, C» } dt if d 7 (t)=7(t)<ft, 7 (0)=0, 
o CeiR" 



(5.1) 



+oo else, 
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and 



( 



I 



■n 



) 



2 



lim — E, 



'7T 



g(X s )ds 



(5.2) 



2/(3,0/ P t (gX)dtdn 



x JO 



The proof is in Section A. 9. 

To the authors' knowledge, this moderate deviations result (even for bounded function) is 
the first one for Markov processes which are not exponentially ergodic. It extends then results 
of Guillin [15, Th 1.] or Wu [33, Th. 2.7] in the subexponential setting. As expected, all ranges 
of speed are not allowed for such a theorem but are limited by the ergodicity of the process (we 
refer to Douc-Guillin-Moulines [5, Sect. 4] for a complete discussion on this interplay). The 
extension of this moderate deviation principle to unbounded function is left for further research, 
as well as extension to inhomogeneous functional and averaging principle, those subjects needing 
particular tools and developments. 

A Proofs 

A.l Proof of Theorem 3.1 

Lemma A.l. For any M > and for any cad-lag function g, 



Proof. First note that g is bounded since it is a cad-lag function. Let ry > be an arbitrary real. 
For any x € [0, M], there exists an interval (x — a, x + a) such that 

Vs £ (x — a, x), \g(s) — g(x—)\ < rj/2 and Vs G [x,x + a), \g(s) — g(x)\ < r//2 

Thus, for any (u, v) in (x — a,x) x (x — a,x) or in [x,x + a) x [x,x + a), \g(u) — g(v)\ < n. 
By compacity of [0, M], there exists a finite number of such intervals (xj — «i,Xj + a^) 
which covers [0, M]. Taking e sufficiently small, it can be easily checked that any interval 
[e(k — 1), ek] is included in some interval (xj — a, Xj + a). Now, if some x» £ [e(k — 1), ek], write 
sup Uit , G [ e ( fc _i), e fc] \g(u) - g(v)\ < 2sup a , g[0jM] \g(x)\. Otherwise, we have sup U)V&[e(k _ l)y€k] \g(u) - 
g(v)\ < r/. Thus, since there is at most intervals [e(k — l),ek] which contain some Xj, 




(A.l) 




The proof follows by letting e — > and by noting that n is arbitrary. 



□ 
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Proof. (Theorem 3.1) Proof of (i) is a direct application of the optional sampling theorem for a 
right continuous super-martingale (see e.g. [7, Theorem 2.13 p. 61]) 



s ' ^ V(X S ) - V(X ) + f <P o V{X u )du - b f 1 C (X U 

Jo Jo 



)du, 



with the bounded ^"-stopping time r = tc(S) A M and by letting M — ► oo. We now prove (ii). 
Let = H7 1 (Hj,(u) +t)- HT 1 ^). Note that 



gg(Vu) _ cpoH^jH^ + t) _ <j>oH^-{H^u)+t) 
<t>(u) ~ ^oH-^H^u)) 



du 

dG(t,u) 
Ft 



(A.2) 
(A.3) 



By log-concavity of tpoH^ 1 , for any fixed t, u t— > 9G q^ is non increasing and thus, for any fixed 
t, the function u i— ► G(t, u) is concave. 

Let e > 0. Write = e/c and 

fsup{A; > 1; t fc _i < r c (5)} if r c (<5) < oo 
I oo otherwise. 



Note that by (i), Pa;( T c(^) < °°) = 1- It is straightforward that tc(S) < eiV e and that eiV e is a 
^-"-stopping time. This implies that for any M > 5, 



T C (S)AM 



Hl l (s)ds 



G(0,V(x)) < limsupE, 



lim sup E x 



(eiV £ )AM 





r e(N e AM £ ) 
J 4>oH^\s)ds 



H-\s)ds 



< limsupA(e) 



G(0,^(x)) 

-G(0,V(x)) 
(A.4) 



where 

M e := [M/e\, 

A(e) := E, [G?(e(JV e A M e ),V(X e{NeAMe) )) - G(0, V(x))] + E 
We now bound limsup^Q ^( e )- First, write for any e > 0, 
A(e) = E a 



e(iV e AM e ) 



H7 1 (s)ds 



< E, 



fc=i 

^E[G(t fc ,y(x ifc ))-G(t fc _ 1 ,y(x ifc _ 1 ))|^ fc _ 1 ]i TcW>tfe _ 1 



+ E X 









r-e(Af E AM e ) 



,fc=l 



+ E X 









fe(7V e AM e ) 



(A.5) | 
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where we have used that {tc(5) > ifc-i} £ 3 r t k _ 1 - Moreover, by concavity of u — ► G(t,u), 



E [G(t fc , F(X t J - G(tk-i, 



tfc-i 



St 



Replacing by the expressions of the partial derivatives and ^ given in (A. 2) and (A. 3) and 



inserting the resulting inequality in (A. 5) yields, combining with D(C, V, cj), b) 
™± ( f** 4>oV(X a )d.° 

ik=l V 

e(N c /\M e ) 



A(e) < E a 
b 



+ e l rc{5)>tfc _ 1 



+ 



1 



Consider the first term of the rhs. By Fatou's lemma, 

Jl k _ i( f>oV(X s )ds 



lim sup K x 



< E, 



\_M/e\ 

fe=i 



+ e 



VW SU P n^)) + M)limsup £ ^ 



0(1) 



by applying Lemma A.l with g(s) := <po V(X S ). Thus, using again Fatou's lemma, 

-tc(8)aM 



E, 



G(0,F(z)) 

re{N e AM e ) 



< limsup.A(e) < lim sup K x 

e^O 0(1) e^O 



H-\s + e)l c (X s )d S 



< 



II) 



E x 



1 



-E, 



s <Tc-((5)+e 
-5 



/ 0° ^H^CpQ ( limsupl s<eAre< 

f M 1 6 

jf <j>oH^\s)lc{X s )l s < To{5) ds = — J d>oH^\s)ds 



The proof follows by letting M —> oo. 



□ 



A. 2 Proof of Proposition 3.4 

The ^-irreducibility results from [21, Theorem 1.1]. Under the stated assumptions, there exists 
a finite constant b' such that RV{x) < V(x) + b' where R denotes the resolvent for the process 
R(x, dy) = J eyip(—t)P t (x, dy)dt. This shows that the set {V < oo} is absorbing for the 72-chain, 
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and since R is ^-irreducible, it is full or empty [22, Proposition 4.2.3]. Since C C {V < 00} , 
this set is full. 

Let B be a closed accessible petite set, the existence of which is proved in [21, Proposition 3.2(i)]. 
Since B is accessible, there exists to and 7 > such that inf xe c P^ {tb < to) > 7- Observe indeed 
that we can assume without loss of generality that C is z^ a -petite for some maximal irreducibility 
measure u a [21, Proposition 3.2]. Hence 

< v a {B) < ¥ x (X € G B) < F x (Xt G B,£< t )+F x (£ > t ) < F x (r B < to) + P(£ > t ) , 

where £ ~ a(dt) is independent of the process. Choose to such that P (£ > to) < 0.5v a (B) and 
the existence of 7 follows. In the proof of [21, Proposition 4.1], it is shown that for all 5 > 0, 
there exists a constant c < 00 such that for all i£X, 

Ex [r B ] < E^ [t c (8)} + c. 

Hence, by Theorem 3.1, there exists a constant c < 00 such that E^ [tb] < cV(i). This implies 
that the level sets B n = {V < n} are petite (see the proof of [21, Proposition 4.2]). 
Since {V < 00} is full, D n B n is full. This implies B n is accessible for n large enough, and 
C C B nf for some (and thus all) n* > sup c V. Finally, since v a is a regular measure, there 
exists a compact set B such that CCBC B nt and v a {B) > 0. This concludes the proof. 



A. 3 Proof of Proposition 3.5 



We can assume without loss of generality that r G Ao and we will do so. 
By [11, Lemma 20], there exists a constant k < 00 such that 

G c (x,f,r;t) <k^ G c (x,f,r;5). 



(A.6) 



Since sup c Gc(x, /, r; <5) < 00, that for all for all t > 0, Af t = sup c Gc(x, f, r; t) < 00. Let to 
be such that for some 7 > 0, ini x& c F x (tb < to) > 7 > (such constants always exist, see the 
proof of Proposition 3.4). 
Let T k be the fcth-iterate of r = re (to) 



T k-l +Tod T*-l 



k>2, 



where 6 is the usual shift operator. Define for n > 2, the {0, l}-valued random variables (u n ) n 
by u n = 1 iff tb o # rn < to- Then by definition, u n G JP" T n and P x (tt n = l^-n-i) > 7 > 0. 
Finally, set 77 = inf{n > 2, u n = 1}. Then it holds 



Gb(x, f, r; to) < E a 
Define for all n > 2, 



r(s)/(X s ) 



r r n 

<> L / r( S )/(X s )ds 

n>2 ^ 



a^ra) = E 2 



- r n-l 

/ r(s)f{X s ) ds l v > r 
Jo 



b x (n) = E x [r(r n - 1 ) 1 



r;>nj 
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Then by the strong Markov property and the property r(s + t) < r(s)r(t) for all s,t > 0, we 
have 

G B {x,f,r;t Q ) < ^ (a x (n) + M to b x (n)) . 

n>2 

Following the same lines as in the proof of [26, Lemma 3.1], it may be proved that for all n > 3 

b x (n) < p b x (n - 1) + c (1 - 7) n ~\ a x (n) < (1 - 7) a x (n - 1) + M b x (n - 1), 

for some positive constants c < 00 and p < 1. This proves that there exists a constant c < 00 
such that Gb{x, f,r;to) < c Gc(x, /, r;to). By (A. 6), there exists a constant q such that 
Gb(x, f,r;to) < q Gc(x, f,r; 5). This implies that sup^g^ Gb(x, f, r; to) < 00. Finally, for all 
n > 1 we write 



Gb(x, /,r;t + nt ) < E a 



r s (to)o0 T S (i o)+r5(to) 



r(s)/(X s ) 



< E T 



+ E X 

Tg(t ) 



r(s)f(X s ) ds 
r(rS(i )) E XTg(to) 
r(s)/(X s ) 



TB (to ) 



r(s) f(X s ) ds 







+ sup G B (*,./>; t ) E x [r(r3(to))] 

x&C 



Since / > 1 and lim t _ >00 r(t)/ f t r(s)ds = for all r G Ao, there exists a constant c < 00 such 
that for all n large enough 



G B (x, /, r; t + nt ) < cE x 



"5 (*o) 



r(s)/(A s ) ds 



As in the proof of [11, Lemma 20] (see also [21, Lemma 4.1] for a similar calculation), the term 
in the right hand side is upper bounded by c n t Gc(x, f, r; 5) and this concludes the proof. 



A. 4 Proof of Proposition 3.7 

We prove that (i) and (ii) are equivalent. That (it) implies (i) is trivial. For the converse 
implication, we start with proving that {x G X, Gc(x, f,r;5) < 00} is full. This can be done 
following the same lines as the proof of [21, Proposition 4.2] upon noting that (a) by [11, Lemma 
20], there exists M < 00 such that for all t > 0, G c (x, f,r;5 + t) < G c (x, f, r; 5) + M* ; (b) we 
can assume that C is f a -petite for some maximal irreducibility measure v a and a distribution a 
such that / M t a{dt) < 00 ([21, Proposition 3.2(h)]). Proposition 3.5 now implies that the sets 
C n = {x G X,Gc(x, f,r; 5) < n} are (/, r)-regular and thus petite ([21, Proposition 4.2(i)]). As 
in the proof of Proposition 3.4, we thus deduce that there exists a (/, r)-regular set, which is 
petite, closed and accessible. 

We have just proved that under (i), the sets C n are (/, r)-regular petite sets and U n C n is full. 
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This shows that (i) (Hi). 

We finally prove that (Hi) =>• (ii). Since ip(U n C n ) > 0, C n C C n+ \ and ip is regular, there exists 
and a compact set A such that A C C n „ and ^(^4) > 0. Hence, A is accessible; furthermore, 
it is (/, r) -regular (and thus petite) as a subset of a (/, r) -regular set. 



A. 5 Proof of Proposition 3.8 

(i) We first prove that 

]T ^o^(i rat ) 



E 



k=0 



< m~ l E 



m{T miB AM) 



{(f) o F(X S ) 







+ E x [m(T mtB AM)}. 

(A.7) 



where M is any positive real number. Write 
( T ^ M \ / rT m , B AM 

e x I E 4>oV{x mk )\ -^iyj o 



<p o V(X ms )ds 



= E 

/ oo 

<e* E 



fc=i 

oo r ^fc 



{</» o ypr mfc ) - <p o y(x ms )}d s 



lfc<T m . B AM 



{0' o V(X ms )(V(X mk ) - V(X ms ))}ds 



k-1 



< V / E x [Es (F(X mfc ) - V(X ms )| ^ mfl ) 0' o y(X ms )l fc < T?? 



; am ds 



< ty'(l)E a 



E / / !c(^u)^ 1 k<T rr 



3 AM 



b'(l)E x 



m(T mtB AM) 



l c (X u )du 



< b<t>'(l)E x [m{T m , B A M)] 
Finally, 



E, 



T m . B /\M 



V{X ms )ds 



m' 1 E x 



m{T m , B AM) 



V(X s )ds 



and (A.7) is established. The drift condition D(C, V,(/>, b) and the optional sampling theorem 
imply 



E, 



Combining (A.7) and (A. 8) yields 



V(X s )ds 



< V(x) + b E x [m{T m>B A M)\ . 



(Ai 



E, 



E </>°V{X, 



mk J 



k=0 



< m- l V(x) + c E x [T m , B A M] , 
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for some finite constant c. Since sup^ V < oo, by Proposition 3.4 and Theorem 3.1, there exist 
a closed accessible petite set A and for all 5 > 0, a finite constant c$ such that 



E x M<5)] < C tf V{x) 



SUp V < OO. 

A 



Furthermore, under the stated assumptions, the process is positive Harris-recurrent [21, Theorem 
1.2] and since some skeleton is irreducible, there exists a maximal irreducibility measure v and 
to > such that inf xe ^ infi>i P t (x, •) > u(-) ([23, Proposition 6.1] and [21, Proposition 3.2(ii)]). 
Hence, there exists 7 > such that inf xgy i mi to <t<t +m. Ex (Xt £ B) > 7. Following the same 
lines as in the proof of [11, Proposition 22(h)], it may be proved that E x [T m> #] < c'V(x) for 
some constant d < 00, thus concluding the proof. 
(ii) Since r* = <f> o H7 1 is increasing, 



E, 



y~] r*(km) 



fc=0 



r*(s)ds 



As in the previous case, we show that inf xey i inf <0 < t < to+m P x (X t £ B) > 7 > for some closed 
accessible petite set A. The result now follows from [11, Proposition 22(h)] (with a minor 
modification : the authors claim that T m ^B < t v while we have mT m ^ < t^) and Theorem 3.1. 



A.6 Proof of Theorem 3.10 

The theorem is a consequence of [11, Theorem 1] and of results by Tuominen and Tweedie [31] 
on discrete time Markov chains. We nevertheless have all the ingredients in this paper to rewrite 
the proof of [11, Theorem 1] in few lines. For ease of the proof of the new results, we start with 
this concise proof. 

Let P m be the irreducible skeleton. We can assume without loss of generality that or, £ Ao, 
$1 o r, > 1 and ^2 /* > 1, and we do so. Write t = km + u for some < u < m and a non- 
negative integer k. Since $ior t 6 Ao and is a non-decreasing rate function, o r*(km + u) < 
$1 o r*{km) $1 o r*(m). Furthermore, if \g\ < ^2 f*i upon noting that ^2 and 4> are non- 
decreasing concave functions 

P u \g\ < P u {^2 o <p o V) < ^2 o (P u y) < ^ 2 o (V + bm) < * 2 (/«) + m60'(l) < c * 2 (/*), 

where we used that by (3.1), P U V < V + fru. Hence, there exists a finite constant c such that 

*ior»(t) ||P*(x,-)-vr(-)|k 2 o/, <c*!or»H ||P fcm (x, •) - vr(-)|k 20 /,. (A.9) 

By Proposition 3.4, there exists a V-level set A = {V < re} which is accessible and petite for 
the process. Hence, under the stated assumptions, there exist to and a maximal irreducibility 
measure tp such that inf£>t ini x& A P t (x, •) > ^(-) ([23, Proposition 6.1] and [21, Proposition 
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3.2(ii)]). This implies that A is petite and accessible for the m-skeleton and P m is aperiodic. 
Furthermore, by Proposition 3.8 and the inequality (3.3), 



sup E x 

A 



T m> A — 1 

*ior»(j'm)* 2 o/,(X 

j=0 



< OO. 



(A.10) 



We now have all the ingredients to deduce (3.5) to (3.8) from known results on discrete-time 
Markov chains. Eq. (3.5) results from [31, Theorem 4.1, Eq(36)] while (3.6) is established in the 
proof of [31, Theorem 4.1]. (3.7) is a consequence of [31, Theorem 4.2]. Since d[*$>i o r*] £ Ao 
(and thus is non-decreasing), there exists a finite constant c such that for all < \u\ < m, 



i-km 

d[^i o r*](km + u) < cd\^*\ o r*](km — u) < cm" 1 / d[^f\ o r*](s) 

J km—m 

< cm _1 {[\I'i o r*](&m) — [^i o 7\„](&;m — m)} = cm _1 {A[ v I'i o r*](km)}, 

where for a rate function r defined on the non-negative integers, we associate a sequence Ar 
defined by Ar(0) = r(0) and Ar(fc) = r(fc) — r(fc — 1), k > 1. Thus, there exists c < oo such 
that 

o r*](i) ||P'(x, •) - 7r(-)||* a o/. < c A[#! o r.pro) ||P fcm (x, •) - 7r(-)||* a o/. - 

Under the stated assumptions, {Af^i or t ](ira)}(; is a subgeometric rate function defined on the 
integers (see e.g. the class A in [31]). Observe indeed that 



In 



9[f l0rt ]H < ln ( m 1 Ikm +1)md ^ or *^ s ) ds ) _ln(m- 1 A[$ 1 or t ](fcm + ra)) 



km 



km 



< 



km 

ln5[*i o r*]((k + l)m) — lnm (k + l)m 



(& + l)m 



Since <9[ l I / i o r*] G Ao, the discrete rate function {Af'I'i o r*](A;m)}fc is equivalent to the discrete 
rate function {Sf^i o r*](km)}k which is in the class Ao defined e.g. in [31]. (3.8) now follows 
from [31, Theorem 4.3]. 



A. 7 Proof of Theorem 3.11 

Since V € T>(A), there exists an increasing sequence T n j oo of J^-stopping times such that 
for any n, t t— > V(X t /\T n ) — V(Xq) — j Q " AV(X s )ds is a P x -martingale. Denote a + = a V 0. 
We have (AV) + (x) < bl c (x) and thus E x (/ tATn (AV) + (X s ))ds < oo which ensures that the 
quantity E x (J^ ATn AV(X s ))ds is well defined. This implies that 

< E x (V(X tATn )) = V(x) +E X I J AV(X s )ds \ < V(x) + bE x I J l c (X s )ds \ < oo. 
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This allows to write 



-tATn 

E x (V(X tATn )) + E x ( J 0o 



tAT n 

F^+E^f / [AV(X s ) + (ftoV(X s )]ds 

rtAT„ \ 

( / l C (Xs)dsj . 



The previous inequality ensures in particular, by monotone convergence theorem, that E x ( J * (ft o V(X s )d. 
oo. The proof is now completed by noting that 

E x (V(X t )) = E^liminf V(X tATn )) < lim inf E x (V{X tAT ) ) 

n n 

< liminf |F(x) - E x (/ o F(X s )dsJ + bE x (J l C (X s )ds 
= V{x) - E x (J* (ft o V(X s )ds^j + bE x (J l c (X s )ds^j 
where the last equality follows from monotone convergence. 

A.8 Proof of Theorem 3.12 

We first prove (i). It is straightforward that since C is petite for the resolvent kernel, it is also 
petite for the Markov process associated to the semi group Pt- Now, by definition, we have 

poo pu 

E x (R p V{X u )) = / f3e-^ v P v+u (x,V)dv = e Pu RpV{x) - e? u / pe-P v P v (x, V)dv . (A.ll) 

J JO 

This implies that 

Ex f f 0{R P V{X U ) - V{X u ))du 



(5e f3u R p V(x)du 



Pe~P v P v (x, V)dv )du- (3 / P u (x, V)du 



(eP' - l)RpV(x) - (J 0eP u du) e~P v P v (x, V)dv - P u {x, V)du 

(eP' - l)RpV{x) - eP' f 0e-P v P v {x, V)dv = E X {R„V(X S )) - R p V{x) (A.12) 

J o 



Moreover, if D(C, V, (ft, b, (3) holds then, 

E, P(Rf,V(X u ) - V{X u ))d^j <-E x jT f3(ftoV{X u )du +(3bE x J l C {X u )du 



(A.13) 
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Combining (A. 12) and (A. 13) yields (i). Now, consider (ii). By [21, Theorem 2.3 (i) and 
Proposition 4.4 (ii)] and Theorem 3.1, there exist positive constants 5, c\ and c 2 such that for 
any x G X, 



E, 



.fc=i 



< G c (x, <p o V, 1; 5) + ci sup G c (x, c/)oV,l;S)< V(x) + c 2 



where (Afc)fc is a Markov chain with transition kernel Rg, ?c = inf{fc > 1 : G C} and E x 
is the expectation associated to the probability induced by the Markov chain (Xk)k- Write 
W(x) = E x ^X]fc£o^° V(Xk)j an d fix e > small enough so that < sup M>1 <f)(u) — eu < oo. 
This implies that there exists some constant c such that 



W(x) < (l + e)V(x) +c, 



Let C = {x G X : W(a;) < sup^ o V + A} where A is a positive number such that (sup c <f> o 
V + A - c)/(l + e) > 1. Note that C C C since if x G C, W(a:) = o V(i) < sup c </> o F and 
thus, x £ C. This implies that for all x g" C, 



fl^W(x) = W(a?) - o V(x) < W(x) -<po W(x) , 

with (p is a non decreasing differentiable concave function such that 4>(u) 
sup c <j) o V + A. Moreover, for all x G C, 



(A.14) 



f=f ] for u > 



fljgWXa;) - W(a:) + o W(x) < sup < E a 



TO 



jfc=l 



+ 4>oV{x) 



<sup{y(x) + c 2 + (/>oy(x)}. (A.15) 

c 



Since <fi o V < If on X, V and <fi o V are finite on (7. By (A.14) and (A.15), there exists a 
constant 6 such that for all x G X, 



oW + bl£. 



Moreover, we have by straightforward algebra hm^ r^(t)[r^((l + e)t)\ 1 = 1 + e. It remains to 
check that C is petite w.r.t. Rq. Since C is included in some set {V < n} which is petite w.r.t. 
the semi group Pf, we have that C is petite w.r.t. the semi group Pt which implies by [21, 
Proposition 3.2] that C is petite w.r.t the Markov transition kernel Rp. The proof is completed. 

A. 9 Proof of Theorem 5.1 



(i) We first prove that P m W < W — <j> o W + b'lc- This a consequence of Proposition 3.8 
and Theorem 14.2.3 (ii) in Meyn-Tweedie [22]. Indeed, since sup c V < oo, (i) shows that 
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swp xeC E x Yll=o 1 (f>°V{Jt km ) < oo. uenne a m , c 

E x [X^fc=o° ( t ) ° y(-^fcm)] • Then the function W satisfies the conditions (see [22, Chapter 14]). 
As discussed in the proof of Theorem 3.10, for all n > n* the level sets {V < n} are accessible 
and petite for the skeleton chain P m . As a consequence, either sup^ V < n* and we may replace 
C by {V < n*} in the previous drift inequality, or sup c V > n* and we choose C = C. 
(ii) The Moderate deviations principle (or MDP) comes from a decomposition into blocks and 
a return to the discrete time case. Assume that m = 1 which can be done without loss of 
generality. In fact, by (i), the Markov chain (Hj. := A[ fc fc+1 [)fc g N with probability transition Q is 
subgeometrically ergodic with the invariant probability measure tt = P^l^ and satisfies A1-A2 
in the terminology of Douc-Guillin-Moulines [5]. Then, we may write (denoting the integer part 

by L-J) 



< oo. Define <x 



inf{fe > 0, X mk £ C} and set W(x) 



1 



vmL 9{Xs,Ms 



h(e) 



g(X s )ds 



o 



Wehl f k+1 



He) 



Lt/ej-l 



ff (X s )ds + -^- / 5 (A s )ds 



J\t/e\ 

t/e 



where G is obviously a bounded mapping with values in W 1 . By the boundedness of g, it is easy 
to see that the second term is exponentially negligible in the sense of moderate deviations, and 
thus 5| and X)L=o 1 ^O^fc) are exponentially equivalent, and share the same MDP. 



Note now that by Theorem 7 of Douc-Guillin-Moulines [5], under the subgeometric ergodicity 

k) s - 1 h?(e) 



of (Hfc) and the condition on the speed, l^L=o 1 ^C^fe) satisfies a MDP with speed 



and rate function 



r i 



o ceK d 



1 



sup < 7 (t), C) - -a 2 ((G, 0) rfi if d7(t)=7(t)cft, 7(0)=0, 



L +oo 



else, 



where 



/n-1 

a 2 ((G,C))= lim -E* VG(S 

n— >oo ?7, i ' 



On the other hand, by the subexponential ergodicity, the boundedness of g and M n {g,0 = 0, 
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we have that f °°(Pt(g,Q ~ ^((fisC)))^ is absolutely convergent in L l (ir). Thus 

a 2 ((G,0) = hm -E n ( T g(X s )ds) 

= hm —E n ( ^ ds [ S (g, ()P u (g, ()du 

= 2 / {g,C) / P u (g,Odudn 
Jx Jo 



^«S,C», 



and then 1% = 1%. 
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